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Analyse$  to  dctertnino  the  degroo  to  which  the 
groups  correleted  with  each  other  indicated  similarit/ 
rather  than  difference:  Junior  College  with  undergraduates, 
r ® .87;  Junior  College  with  graduate  students,  r = .75; 
and  undergraduates  with  graduate  students,  r = .91. 

The  difficulty  in  finding  meaning  in  the  pattern 
of  significant  differences,  plus  the  high  correlations 
found  hetwoon  the  rated  means  of  each  group,  indicate  that 
the  differences  could  be  attributed  to  chance.  Another 
interpretation  is  that  other  aspects  of  the  educational 
variable,  not  included  in  this  study,  may  account  for  the 
differences.  However,  this  study  suggested  that  the 
dependent  variables  of  educational  level,  socio-economic 
status,  and  degree  of  authoritarianism  did  not  produce 
significant  differences  among  ratings.  The  overall  dif- 
ferences for  Individual  taped  segments  were  small,  con- 
sidering the  short  training  period.  Although  individual 
raters  varied  to  some  extent,  these  variations  wore  not 
systematic.  The  moan  ratings  for  each  segment  were  in 
close  agreement  with  the  more  extensively  trained  raters 
from  the  University  of  Arkansas  (r  • .77). 

On  the  basis  of  this  study,  it  appears  that  the 
validity  of  comparing  results  between  research  studies, 
when  the  raters  for  each  study  are 


different,  is  supported. 


The  major  implication  is  that  the  large  number  o£ 
research  reports  using  the  Truax  Accurate  Empathy  Scale 
nay  bo  conpared  with  some  confidence.  This  study  also 
implies  that  Che  training  of  raters  is  not  a complicated 
job  requiring  many  hours  or  a highly  skilled  trainer.  The 
present  findings  suggest  that  almost  any  college-level 
group  can  be  quickly  trained  to  rate  levels  of  accurete 
empathy  and  that  such  ratings  can  more  likely  be  compared 
on  an  absolute  as  well  as  a relative  basis. 


CHAPTER 


INTRODUCTION 

An  Oosrviw 

Results  of  research  focusing  on  factors  present 
in  all  instances  of  successful  psychotherapy  have  engen- 
dered a new  spirit  of  inquiry  traiong  practioners  in  the 
fields  of  counseling  and  psychotherapy.  Scales  developed 
by  Truax  for  ntoasuring  these  essential  factors 

of  successful  therapy  are  one  of  the  main  causes  of  this 
new  interest  in  evaluating  therapeutic  encounters.  The 
scales  consist  of  a number  of  progressive  stages,  each 
describing  a more  meaningful  relationship  of  the  therapist 
with  the  client  according  to  the  former's  accurate  empathic 
understanding,  his  nonpossossivo  warmth  and  his  own  autlien- 
ticity  or  genuineness.  The  evaluations  are  obtained  by 
talcing  short  excerpts  from  the  tape-recorded  counseling 
interviews  which  are  then  rated  by  judges  trained  in  the 
use  of  one  or  more  of  the  scales.  The  training  of  judges 
has  varied  from  study  to  study,  from  on-the-spot  training 
to  establish  adequate  interjudge  reliability,  to  highly 
trained  raters  who  may  have  spent  up  to  three  hundred  hours 
learning  to  discriminate  levels  of  accurate  empathy,  non- 


possessive  warmth  or  therapist's  genuineness. 


These  scales  are  being  used  in  an  ever-growing 
nuiuber  of  studies.  Studies  carried  out  under  varying  con- 
ditions and  in  different  areas  of  the  country,  using  dif- 
ferent sets  of  judges,  are  being  conpared  as  if  the  scales 
themselves  were  absolute  ones. 

The  present  study  on  the  Accurate  Empathy  Scale  has 
as  its  principal  aim  the  investigation  of  the  validity  of 
such  comparisons.  Is  it  legitimate  to  compare  two  separate 
studies  which  have  two  different  sets  of  raters  using  the 
same  scale?  One  study  may  claim  that  a therapist  must  score 
at  least  4.5  on  the  Accurate  Empathy  Scale  to  be  considered 
a successful  therapist.  Is  this  true,  then,  for  all  thera- 
pists whose  interviews  are  rated  according  to  this  scale? 

In  other  words,  are  the  rated  scale  values  relative  or 
absolute?  This  study  proposes  that  certain  characteristics 
of  the  raters  will  influence  their  perception  of  empathy  as 
proposed  In  the  descriptive  scales  and  thus  a certain  scale 
value  will  be  interpreted  differently  by  raters  with  dif- 
ferent personality  characteristics. 

According  to  Combs  and  Snygg  (1559,  p.  126ff), 
personal  characteristics  would  be  a very  important  variable 
in  the  subjective  judgment  of  the  raters.  "A  person  behaves 
in  terms  of  his  phenomenal  self."  This  self  is  "essentially 
a social  product  arising  out  of  experience  with  people." 
Thus,  in  Combs'  terms  we  may  say  that  rating  tapes  is  "an 
internal,  individual  phenomenon  and  the  perceptual  field 


of  one  individual  may  be  quite  different  from  that  of  another, 
even  in  what  seems  to  be  the  same  situation." 

The  variables  based  on  education,  environmental 
background  and  personal  authoritarianism,  which  will  be 
discussod  in  the  next  chapter,  were  selected  as  important 
discriminative  characteristics  influencing  the  judgment  of 
racers.  Should  the  raters  in  this  study  differ  signifi- 
cantly according  to  these  variables,  then  the  Accurate 
Empathy  Scale  must  be  considered  a relative  rather  than 
an  absolute  scale,  and  the  cross-comparisons  of  separato 
studies  would  be  invalid. 

In  seeking  answers  to  these  questions,  the  author 
acquired  excerpts  from  professional  counseling  tapes  which 
represented  the  various  stages  of  accurate  empathy.  He 
then  called  upon  on  equal  number  of  Junior  College  under- 
graduate and  graduate  students,  who  were  not  familiar  with 
the  scale,  to  serve  as  raters.  The  groups,  divided  by 
level  of  education,  were  subdivided  by  soci-economic  status 
and  finally  divided  by  high  and  low  level  of  personal 
authoritarianism. 

The  Accurate  Empathy  Scale  was  chosen  because  the 
reliabilities  of  the  scale  have  been  generally  higher  than 
the  other  scales  and  because  it  has  proven  most  highly 
related  to  outcome  (Truax  and  Carkhuff,  1967,  p,  4S). 

Because  of  the  high  reliability  of  the  scale,  deviations 
in  this  study  would  be  more  easily  attributed  to  the 
specified  variables.  Another  reason  for  choosing  this 


stages  would  give  a wider  distribution 


o£  ratings  than  the  five  stages  o£  the  other  two  scales. 

Z’he  Baakground 

"I  never  promised  you  a rose  garden." 

In  Hanna  Green's  book  about  a young  female 
schltophrenic,  the  therapist,  Dr.  Fried,  offers  to  help 
the  teenage  patient  in  her  struggle  to  re-enter  the  real 

She  [Dr.  Fried]  liked  working  with  patients. 

Their  very  illness  made  them  examine  sanity  as  few 
"sane"  people  could.  Kept  from  loving,  sharing  and 
simple  communication,  they  often  hungered  for  it  with 
a purity  of  passion  that  she  saw  as  beautiful  (Green, 

1964,  p.  17). 

The  author  portrays  the  therapist's  deep  awareness 
and  respect  for  the  feelings  and  needs  of  her  clients. 

Such  understanding  has  long  been  recognized  as  beneficial 
by  the  majority  of  therapists.  Attributes  such  as  accurate 
and  sensitive  awareness  of  the  client's  feelings,  genuine 
Interest  in  and  concern  for  his  welfare  without  attempting 
to  direct  his  life  and  honest  communication  of  the  therapist's 
reactions  to  the  client  are  being  considered  more  and  more 
as  the  central  ingredients  of  successful  therapy. 

Many  forms  of  varied  psychological  treatments  have 
been  enthusiastically  acclaimed  as  TBS  therapeutic  approach 
by  their  proponents.  With  few  exceptions,  a particular 
form  of  psychotherapy  lias  not  produced  significant  clianges 


Rogers  (IRS?)  proposed  tliroo  ingredients  which 
he  considered  "necessary  and  sufficient  conditions"  for 
successful  therapy.  They  were  the  therapist's  ability  to 
communicate  eiapathic  understanding  and  unconditional 
positive  regard  for  the  client  and  his  being  a congruent 
or  genuine  person  In  the  relationship.  These  basic  con- 
ditions of  a helping  relationship  may  vary  in  the  degree 
to  which  they  are  emphasized  within  each  approach,  but, 
in  general,  with  differences  in  meanings  of  the  constructs, 
they  aro  common  to  most  theories.  Rogers  and  Truax  (1967, 
p.  100]  suggested  that  genuineness  is  the  oiost  basic 
clement  of  the  three.  A client  must  perceive  his  therapist 
as  real  before  lie  will  experience  warmth  and  respect  of  the 
therapist  as  more  than  just  a technique.  Wliile  the  effcc- 
tiveucss  of  accurate  empathy  may  be  based  on  warmth  and 
genuineness  of  the  therapist,  this  does  not  mean  that  they 
are  stages,  one  built  upon  the  other.  It  is  through  the 
communication  of  ompatliic  understanding  that  the  client 
perceives  the  warmth  and  genuineness  of  the  therapist. 

There  is  a cyclic  development  within  the  relationship: 
the  client  connaunicates  a part  of  himself,  the  therapist 
is  able  to  express  his  empathic  understanding,  genuinely 
and  warmly,  which  in  turn  helps  the  client  to  risk  more 
of  himself.  The  therapist,  too,  with  this  Increasing 
understanding,  finds  it  easier  to  be  in 


the  relationship. 


With  tho  «stabl ishmcut  of  the  constructs  ss  contra! 
ingredients  of  good  therapy,  the  next  step  is  the  specifi- 
cation of  degrees  within  each  construct.  As  rating  scales 
were  developed,  further  modification  of  the  essential 
conditions  became  necessary.  Accuracy  in  empathic  under-  / 
standing  replaced  empathy  as  a construct.  Evidence  sug- 
gested that  "unconditional  positive  regard"  was  less 
related  to  outcome  than  "nonpossessive  warmth."  Also, 
congruence,  which  suggested  a relationship  between  the 
therapist's  organismic  self  and  his  self-concept,  was  more 
effective  when  seen  as  a relationship  with  the  client  in 
tho  sense  of  an  absence  of  defensiveness  or  phoninoss. 

Thus  the  construct,  "genuineness  or  therapist  self-congruence, 
became  the  third  therapeutic  condition  for  which  scales  were 
constructed  (Truax  and  Carkhuff,  1967,  p.  ag). 

The  development  of  reliable  scales,  by  which  short 
excerpts  of  taped  interviews  are  raced,  is  the  breakthrough 
that  tho  field  of  psychothorapy  needed  for  self-evaluation. 
These  scales  measure  three  important  aspects  of  the  psycho- 
therapeutic relationship;  accurate  empathy,  nonpossessive 
warmth,  and  therapist's  self-congruence.  As  Jerome  Prank 
(1967)  stated,  "The  research  findings  [based  on  these 
scales]  resolve  some  of  tho  most  stubborn  research  problems 
in  psychotherapy.  . . . The  impact  of  these  achievements 
may  well  be  revolutionary"  (p.  xl). 


Th%  first  of. these  scales,  the  Accurate  Empathy 
Scale,  developed  by  Trusx  in  1D61,  is  the  object  of  the 
pre.scnt  study.  The  present  study  aims  at  Investigating 
Interrater  variables  which  nay  significantly  influence  the 
absolute  ratings  of  enpathy.  Since  its  inception,  the 
Accurate  Empathy  Scale  has  been  used  in  hundreds  of 
studies.  In  research  using  these  scales,  under  the 
direction  of  their  author,  over  seventeen  thousand  ratings 
were  made.  Each  of  these  ratings  dealing  with  accurate 
empathy  was  based  on  a nine-point  scale  {further  details 
about  the  scales  will  be  found  in  review  of  the  literature) 
Originally  Truax  and  his  confreres  (Rogers,  Gendlin 
Kiesler  and  Truax,  1967,  pp.  54-55)  thought  that  it  would 
require  trained  therapists  to  rate  the  taped  interviews 
since  the  qualities  of  a therapeutic  relationship  were 
considered  so  intangible.  The  researchers  experimented 
with  different  segment  lengths,  even  modifying  the  scales 
as  they  sought  a reliable  scale.  They  soon  realized  that 
to  use  individuals  who  were  sophisticated  in  the  fields 
of  counseling  and  psychotherapy,  they  would  necessarily 
restrict  the  number  of  raters  from  which  they  could  draw. 
They  also  found  that  this  very  hnowlodgoability  of  the 
clinical  process  moved  the  therapists  to  rate  on  other 
than  the  definitions  specified  by  the  scale. 


It  soon  became  evident  that  If  these  qualities  of 
oinpathy.  nonpossossive  warmth  and  therapist's  genuineness 
woro  real  and  recognizable  [and  they  must  be  if  they  are 
truly  facilitating  conditions)  then  they  should  be  real 
and  recognizable  to  any  intelligent  individual  even  without 
any  training  in  therapy.  Also,  the  use  of  naive  raters  has  ! 
boon  studied  and  supported  by  Arnhoff  [1954)  and  Cronbach 
(1960). 

It  was  decided  by  Truax  and  his  colleagues  to 
select  undergraduate  students  as  raters.  The  rating  scales 
were  refined.  (Accurate  Empathy  Scale  is  in  Appendix  C.) 

A specific  definition  was  chosen  for  accurate  empathy  and 
precise  behavioral  terms  were  found  for  describing  each 
level  or  stage.  Students  who  were  looking  for  work  wore 
then  recruited  for  the  study.  The  only  requirements  were 
that  they  have  a B average  or  better  and  be  willing  to 
commit  themselves  for  a long  period  of  work.  Students 
were  then  assigned  to  small  rating  groups  for  particular 
constructs.  They  were  given  brief  but  intensive  instructions 
in  the  meaning  of  the  construct  and  the  scale  which  they 
wore  to  use.  They  wore  also  instructed  not  to  communicate 
with  each  other  about  their  work  until  it  was  completed. 
Although  time  consuming  and  costly,  this  study  had  many 
values.  All  of  the  construct  ratings  were  completely 


independent,  done  by  different  groups  of  Judges  who  had 
no  communication  witli  each  other.  None  of  Che  ratings 
was  biased  by  the  rater’s  point  of  view  of  what  therapists 
ought  to  be,  because  every  rater  was  decidedly  naive  in 
this  respect.  The  researchers  expressed  it  this  way: 

"It  removed  from  our  work  more  completely,  we  believe, 
than  has  ever  been  done  in  any  psychotherapy  research, 
any  possible  element  of  bias.  It  made  it  clear  that 
whatever  findings  emerged  would  have  a solidity  heretofore 
unknown  in  this  field"  (Rogers,  Gendlin,  Kioslor,  and 
Truax,  1967,  p.  56]  . 

From  "hundreds  upon  hundreds"  of  hours  of  taped 
therapy  interviews,  randomized  segments  were  chosen  by 
precise  methods  to  insure  research  objectivity.  Those 
segments,  from  two  to  four  minutes'  duration,  were  rated 
by  the  trained  Judges  according  to  the  nine*point  scale. 
Between  the  years  1961  and  1966,  over  twenty-five  studies 
on  the  reliability  of  the  scales  were  carried  out.  Cor- 
relations of  .81  to  .95  In  twelve  of  the  twenty-eight 
studies  suggest  an  adequate  degree  of  interjudge  relia- 
bility. For  example,  Truax's  first  study,  done  in  1961, 
had  384  samples  of  seven  therapists  with  an  interjudge 
reliability  correlation  of  .87  (Truax  and  Carkhuff,  1967, 
p.  45). 


Importance  of  tha  Problem 


Since  reliability  coefficients  give  only  relative 
comparisons  of  ratings,  the  question  raised  in  this  study 
concerns  the  use  of  such  interjudgo  reliability  coefficients 
as  the  basis  of  comparison  between  two  studies.  Is  the 
scale  an  absolute  one?  That  is,  can  one  study,  whose 
interjudge  reliability  is  high,  be  compared  with  another 
study  with  an  equivalent  coefficient  but  with  different 
judges?  Are  studios  carried  out  in  ono  part  of  the  country 
able  to  be  compared  with  those  from  another?  Is  a 3.5 
rating  on  the  Accurate  Empathy  Scale  from  a study  at  the 
University  of  Arkansas  the  same  as  a 3.5  rating  at  the 
University  of  Florida?  It  nay  be  that  differences  among 
raters  produce  different  levels  of  ratings.  Specifically, 
do  personality,  environmental  and  educational  background 
influence  the  raters? 

One  of  the  reasons  undergraduate  students  were 
chosen  as  judges,  as  has  been  noted,  was  their  lack  of 
knowledge  of  the  clinical  process.  Those  trained  In 
psychotherapy  have  a tendency  to  judge  segments  by  other 
than  the  criteria  specified  by  the  scales.  Admittedly 
then,  the  raters'  evaluations  are  subject  to  variables 
beyond  the  scope  of  the  instrument.  Truax,  in  a current 


rosesrch  study,  has  found  correlations  between  intorjudge 
reliability  and  age  and  certain  personality  factors.  This 
supports  the  idea  that  raters  with  similar  characteristics 
judge  similarly. 


Statement  of  the  Problem 

This  study  examines  whether  raters  tend  to  vary 
their  judgments  according  to  their  disparate  backgrounds. 
Major  aspects  of  background  chosen  for  this  study  are: 
educational  level,  socio-economic  status  and  degree  of 
personal  authoritarianism.  A rater  with  a highly  author-  ^ 
itarian  personality,  from  a low  social  milieu  and  little 
education,  may  tend  to  rate  segments  relatively  lower  than 
his  counterpart  with  opposite  social  and  educational  in- 
fluences. However,  after  a lengthy  training  period,  a 
group  of  raters,  regardless  of  their  backgrounds,  can 
acquire  high  interjudge  reliability.  This  leveling  is  ^ 
seen  as  the  product  of  the  trainer's  "teaching."  The  -wif 
reliability,  then,  of  any  single  study  may  be  said  to  be 
specified  by  the  trainer.  Thus  the  comparison  of  studies 
using  the  Accurate  tmpathy  Scale,  whose  raters  had  boon 
taught  the  scale  by  trainers  with  different  backgrounds, 
would  not  be  valid  regardless  of  the  strength  of  interjudge 
reliability  for  each  study. 


values  are  so  influenced  by  the  personality,  educational 
and  environinental  variables  stated,  that  the  scale  is 
relative  in  itself  and  not  an  absolute  scale. 

In  ostabllshing  the  direction  of  the  hypotheses 
for  educational  level  and  socie-ocononic  status,  consider' 
ation  was  given  te  a deveiopiaental  concept  of  empathy. 

Walton  (1963)  demonstrated  an  increment  in  empathy  as 
children  mature.  This  development  of  empathy  «as  also 
found  by  Carhhuff,  Piaget  and  Pierce  [1938^  who  presented 
evidence  that  empathy  increases  from  the  beginning  of  under- 
graduate college  to  the  beginning  of  graduate  training. 

They  suggest  this  increment  may  bo  accounted  for  by  a 
number  of  variables,  such  as  increasing  lovols  of  experience, 
interest,  motivation  and  sophistication.  These  variables 
are  associated  with  higher  levels  of  education  and  socio- 
economic status.  Thus,  it  may  be  expected  that  graduate 
students  would  have  higher  levels  of  empathy  than  under- 
graduate students,  and  that  undergraduate  students  would 
be  higher  than  Junior  College  students.  The  same  can  be 
said  for  the  socio-economic  status  levels. 

It  is  presumed  that  those  who  possess  higher  levels 
of  empathy  will  be  more  likely  to  identify  them.  Thus,  the 
basis  for  the  hypotheses  on  educational  level  and  socio- 
economic status  is  that  the  higher  the  level  or  status  of 
the  groups,  the  higher  their  mean 


ratings  of  empathy  will  be. 
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In  short,  it  -is  hypothesited  that  tlie  Accurats 
Empathy  Scale  is  a relative  rather  than  an  absolute  scale. 

I£  true,  this  would  invalidate  cross  comparisons  between 
studies  of  absolute  values.  That  is,  results  of  studies 
using  different  sets  of  judges  could  not  be  validly  compared. 

Hepisa  of  the  Literature 

Stewart  (1556)  in  his  booh,  Prefaae  to  empathy, 
proclaims  empathy  os  the  most  important  activity  in  the  life 
of  one  aspiring  to  become  a person.  Aside  from  being  the 
common  ground  for  ethics,  esthetics  and  dynamic  psychology, 
Stewart  further  proposes  that  being  empathic  is  the  essence 
of  interpersonal  relationships. 

How  does  one  define  his  empathic  quality  so  highly 
lauded  by  both  research  and  opinion?  The  description 
"putting  yourself  in  the  other  fellow’s  place,"  is  the 
most  prevalent  definition  of  empathy  found  in  the  literature 
(Dymond,  1950;  Parkorny,  1959;  Halpern,  1955:  Schaefer,  1959). 
Though  such  a definition  may  appear  na^ve,  it  provides  a 
starting  point  for  this  investigation. 

A definition  of  empathy  such  as  the  above  may  be 
appli.ed  to  the  concept  of  identification.  Dymond  (1945), 

Kerr  and  Speroff  (1954)  associate  those  two  concepts  when 
they  propose  that  the  empathiter  identifies  with  the  object. 
Luchins  (1957)  questions  this  postulate  as  he  cannot  see 
the  necessity  of  identification  in  empathic  responses. 
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especially  when  the  object  of  empathy  is  a group.  Dyoiond's 
(1950)  definition  of  empathy  is  "the  imaginative  transposing 
of  oneself  into  the  thinhing,  feeling  and  acting  of  another." 
Kerr  and  Speroff  (1954)  propose  a similar  definition  of  em- 
pathy as  "the  measure  of  ability  to  put  yourself  in  the  other 
person's  position,  establish  rapport  and  anticipate  his 
reactions,  feelings  and  behavior."  Both  definitions  imply 
identification,  but  it  is  a trial  identification.  It  is 
like  the  reader  of  a novel  identifying  with  the  hero;  that 
is,  seeing  the  hero  from  the  hero’s  own  internal  frame  of 
reference,  but  not  losing  his  own  personal  identity  in  the 
process.  "It  is  the  sensing  of  the  (other's]  inner  world 
as  if  it  were  [your]  own,  but  without  losing  the  as  if 
quality"  (Rogers  st  al.j  1967).  Thus,  the  conflict  appears 
to  be  one  of  semantics. 

The  definition  of  empathy  as  tho  process  of  putting 
oneself  in  the  other  person's  place  while  maintaining  one's 
own  identity  is  also  found  in  psychoanalytic  literature. 

Thus,  for  Schaefer  (1959),  empathy  is  "a  sublimated  creative 
act  in  personal  relationships  which  combines  the  gratification 
of  intimate  union  with  the  recognition  and  enhancement  of  sep- 
arateness and  personal  development  of  both  persons  involved." 

Rogers  (1949)  stressed  the  understanding  aspect  of 
empathy.  It  is  the  understanding  of  the  other's  feelings 
and  communications  as  they  seem  to  that  person  at  that 
moment  in  time.  This  understanding  is  not  based  on  logical 
reasoning,  bvtt  on  insight.  Dymond  (194S)  sees  insight  as 


the  understanding  of  the  self-other  patterns  or  roles  which 
the  individual  has  incorporated,  that  which  forms  the  basis 
of  his  expectations  of  others,  his  structuring  of  his  life 
situations  and  the  place  he  feels  he  occupies  within  them. 

Parkorn/  (1959)  a German  psychologist,  regards 
empathy  as  a nonrational  mode  of  understanding.  He  believes 
this  faculty  to  be  characteristic  of  man  throughout  history 
and  that  it  can  be  found  in  primitive  form  in  animals.  This 
nonrational  mode  of  understanding,  according  to  Stewart 
(1956),  is  distinguished  from  -science  but  not  opposed  to  it. 

Sarly  Attempta  at  Maeaurfnp  Empathy 

The  moasuroment  of  empathy  has  tended  to  take  two 
different  approaches,  the  self-other  rating  methods  and 
the  objective  test  methods. 

Salf’Otkep  Ratinge 

One  of  the  first  methods  used  to  measure  empathy 
was  developed  by  Dymond  (1949).  Dymond^s  procedure  is  for 
individuals  to  be  tested  for  empathy  with  each  other  and 
proceeds  as  fallows: 

For  Subject  A: 


himself  (A) 


thinks  B would  rate  himself 


® would 

For  Subjoct  B: 

1.  B rates  himself  fB) 

2.  B rates  A as  he  [B)  secs  him 

3.  B rates  A as  he  thinks  A would  rate  himself 

4.  B rates  himself  (B)  as  he  thinks  A would 

The  measure  of  A's  ompathic  ability  is  obtained 
by  calculating  how  closely  A's  predictions  of  B's  ratings 
correspond  with  B’s  actual  ratings.  Ratings  may  bo  made 
on  any  characteristic,  such  as  sense  of  humor,  selfish-un- 
selfish and  friendly-unfriendly. 

Dymond  (1950),  in  researching  her  self -other  rating 
scale,  used  eighty  college  students  who  became  familiar 
with  each  other  in  groups  of  four  during  ten-minute  sessions. 
They  then  rated  themselves  and  each  other  as  explained  above. 
Those  who  were  most  accurate  and  those  who  were  least  ac- 
curate in  assuming  the  role  of  others  were  further  studied 
to  find  personality  variables  associated  with  empathy. 

The  results  suggested  that  high  empathiters  appeared  to 
bo  well  adjusted  persons  manifesting  such  characteristics 
as  optimism,  flexibility  and  emotional  maturity.  I.ow 
empathiters,  on  the  other  hand,  were  introverted,  emotionally 


liBnature,  sslf-centered  and  had  unsatisfying  early  family 
relationsh ips. 

Daane  and  Schmidt  C19S7}  measured  empathy  In  a 
similar  predictive  manner.  They  asked  counselors  to  predict 
their  clients’  responses  te  stimulus  words.  High  empathy 
scores  appeared  to  bo  positively  related  to  neurotic  and 
psychotic  tendencies  on  the  HHPI  and  negatively  related  to 
scores  in  the  behavior  problems  category. 

ObjeoSiv4  Tsat  Methods 

Kerr  and  SperofC's  (19S4]  empathy  teat  is  the  measure 
of  ability  to  put  yourself  in  the  other  person's  position, 
that  is,  to  establish  rapport  and  anticipate  his  reactions, 
his  feelings  and  behavior.  This  is  a simple  pencil  and 
paper  test  in  which  subjects  arc  asked  to  rank  such  things 
as  the  popularity  of  certain  types  of  music  as  unskilled 
workers  would  rank  them,  or  the  kinds  of  magazines  most 
enjoyed  by  the  average  American.  The  perceived  rankings 
arc  then  compared  with  actual  rankings  and  a score  is  ob- 
tained. According  to  the  Empathy  teat  manual,  various 
studies  have  produced  significant  validation  results.  How- 
ever, Bell  and  Stolper  (1955)  were  unsuccessful  in  their 
attempt  to  validate  the  Empathy  test  through  a replication 
of  the  study.  Also  Strunk  (1957)  points  out  that  attempts 
at  discovering  a positive  relationship  between  the  Empathy 
taat  and  Oymond’s  measurement  scale  of  empathic  ability 
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were  utisucce$sful.  On  two  attempts  correlations  of  r = .02 
and  r ■ -.06  were  obtained. 

Astin  (loss)  did  a comparative  study  of  the 
situational  and  predictive  approaches  to  the  measurement 
of  empathy.  Por  the  situational  test  she  asked  her  sub- 
jects to  respond  to  ten  tape-recorded  client  statements 
as  if  they  were  in  a counseling  Interview.  Their  responses 
were  ranked  by  seven  professional  psychologists  according 
to  the  degree  of  expressed  understanding  of  feeling  and 
content  in  the  client's  statements.  For  the  predictive 
test,  she  had  the  subjects  listen  to  tape  recordings  of 
four  counseling  interviews,  road  the  personal  history  forms 
of  the  four  clients  involved  and  make  predictions  on  what 
the  clients  would  say  on  the  Allport-Vernon-Lindzey  Study 
af  Valuta  test.  There  was  no  significant  correlation 
between  the  two  approaches.  Astin  concluded  that  the 
predictive  approach  was  not  satisfactory. 

Of  all  the  measures  of  empathy  described.  Dymond’s 
scale  uf  empathic  ability  seems  to  have  been  utilized  most 
extensively.  Her  research  has  shown  that  empathic  ability 
can  be  isolated  and  measured  (Strunk,  1957). 

Of  the  approaches  to  the  measurement  of  empathy, 
Astin's  situational  test  most  closely  approximates  Truax's 
accurate  empathy  .ipproach.  The  major  difference  is  in  the 
criteria  for  evaluating  the  empathic  conditions  offered  by 


the  counselors.  Astiii's  raters  were  professional  psycliolo 
gists  who  ordered  the  counselors'  responses  to  taped 
segments.  The  basis  for  judgement  was  the  psychologists' 
own  interpretation  of  empathy.  This  is  in  contrast  to 
Truax's  approach  using  a ninO'Stage  descriptive  scale  with 
trained  raters  who  do  not  have  a background  in  psychology. 

Dymond  and  Kerr  and  Speroff,  on  the  other  hand 
sought  to  evaluate  subjective  empathy  rather  than  empathy 
as  found  in  a relationship  with  others.  Dymond's  scale  of 
cmpathic  ability  and  Kerr  and  Speroff's  Smpathy  teat  rate 
the  accuracy  of  the  subject's  perception  of  Che  other 
person's  feelings  but  not  the  subject's  ability  to  coranuni 
cate  this  perception  to  the  other  person.  Truax's  scale 
rotes  the  ability  to  communicate  empathy  accurately  and 
thus  is  important  as  an  instrument  for  evaluating  the  coun 
seling  relationship. 

Empathy  and  the  Couneeliny  Eelationehip 

Luchins  (185’)  and  Rogers  (1949)  have  shown  the 
importance  of  empathy  in  the  counseling  relntionship. 

Over  the  years  a general  consensus  has  evolved  concerning 
the  importance  of  the  therapist's  ability  to  understand 
sensitively  and  accurately  the  inner  experiences  of  the 
client.  Similarly,  it  lias  been  concluded  that  the  thera- 
pist's maturity  and  genuineness,  and  his  warmth  and 


acceptance  of  tlie  Individual  with  when  he  is  working  are 
also  of  utmost  importance  in  the  therapeutic  relationship. 
Writers  and  researchers  define  these  elements  a little 
differently  depending  upon  their  own  theoretical  orion- 
tatien  (Shoben,  1953;  Bordin,  1955;  Lesser,  1961).  How- 
ever, it  was  Rogers  (1957)  who  presented  these  conditions 
of  empathy,  warmth  and  genuineness  as  "the  necessary  and 
sufficient  conditions  of  therapeutic  personality  change." 

While  the  necessity  or  the  sufficiency  of  these 
three  therapeutic  conditions  for  change,  as  many  writers 
point  out,  may  be  difficult  to  prove  {Ellis,  1959;  Truax, 
1961) , their  importance  as  growth  facilitators  is  not  under 
scored  by  therapists  alone.  Martin  Buber  (1953,  p.  104), 
theologian  and  philosopher,  points  to  the  need  of  the 
"relations  betwocn  the  one  and  the  other"  for  tlie  inmost 
growth  of  the  self."  It  seems  reasonable  to  assume  that 
conditions  that  facilitate  growth  in  the  therapeutic  re- 
lationship within  the  counseling  process  are  the  same  ele- 
ments found  in  all  relationships  that  are  growth  producing. 
On  the  other  side  of  the  coin,  however,  the  relationships 
between  "the  one  and  the  otlier,"  may  be  the  cause  of  deterl 
oration,  Bateson,  Jackson  and  Weakland  (1956)  showed  that 
the  lack  of  tliose  conditions  within  the  home  was  the  major 
cause  of  serious  disturbances  in  children.  The  possibility 


of  such  doterloratini  conditions  being  present  within  the 
counseling  relationship  was  not  thoroughl)'  explored  until 
Eysenck  (1952)  presented  his  disturbing  findings,  lie  con- 
cluded from  his  review  of  nineteen  studies  including  seven 
thousand  cases  that  on  the  average,  psychotherapy  does  not 
result  in  improvemont  greater  than  "no  treatment."  Many 
writers  [Rosentweig,  1954;  Luborsky,  1954)  attacked  the  basis 
on  which  Eysenck  node  his  conclusions,  charging  that  the 
evidence  was  selective,  biased  and  distorted. 

Truax  and  Carkhuff  (1967,  p.  14)  reviewed  the 
research  literature  on  counseling  and  psychotherapy  care- 
fully and  came  to  Che  conclusion  that  unfortunately  Eysenck 
was  correct  in  saying  that  the  average  counseling  and 
psychotherapy  of  the  studies  reviewed  did  not  result  in 
average  client  improvement  greater  than  observed  in  persons 
who  received  no  special  counseling  or  psychotherapy  treat- 
ment. Prank  (1961)  also  reached  this  conclusion,  noting 
that  two-thirds  of  neurotic  patients  are  improved  immediately 
after  treatment  regardless  of  the  type  of  psychotherapy  re- 
ceived. However,  the  same  improvement  rate  was  found  for 
those  who  did  not  receive  treatment.  The  conclusion  (Truax 
and  Carkhuff,  1967;  Truax  and  Mitchell,  1968)  is  that  some 
counselors  aro  significantly  helpful  while  others  are  sig- 
nificantly harmful,  with  a resulting  average  not  demonstrably 
better  than  the  average  of  no  professional  help. 
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One  o£  the  first  studies  of  process  research 
psychotherapy  was  carried  out  by  Whitehorn  and  Betz  (1954). 
They  studied  seven  psychiatrists  with  an  improvement  rate 
of  75  percent  and  an  equal  number  with  an  improvement  rate 
of  27  percent.  Their  evidence  indicated  that  the  successful 
therapists  were  warm  and  attempted  to  understand  the  clients 
in  a personal  way.  The  less  successful  therapists  tended 
to  relate  to  their  clients  in  a more  impersonal  manner, 
focusing  on  a diagnostic,  external  kind  of  understanding. 
Betz's  (1963)  description  of  successful  and  unsuccessful 
therapists  was  consistent  with  the  therapeutic  condition 
presented  by  Rogers  in  his  1957  statement. 

Halkides  (1958)  reported  finding  that  client- 
centered  therapists  who  wore  Judged  most  successful  offered 
their  clients  conditions  rated  higher  in  empathy,  warmth, 
and  genuineness  than  did  those  judged  least  successful. 

The  difficulty  of  rating  these  conditions  is  evidenced  by 
Hart's  (1950)  inability  to  duplicate  the  above  study.  The 
subjective  evaluations  of  racers'  judgments  without  the  aid 
of  precise  scales  rendered  research  in  this  area  difficult 
and  dubious.  There  was  an  evident  need  for  reliable  research 

At  first  the  attempts  to  specify  degrees  of  empathy, 
warmth  and  genuineness  made  "heavy  use  of  the  raters,  who 
were  asked  to  draw  upon  their  own  abundant  experience  of 
being  understood  and  misunderstood,  warmly  and  coldly 
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rocoivGd.  and  met  with  an  artificial  and  a genuine  human 
being"  (Truax  and  Carkhuff,  1967,  p.  43).  The  earl)'  scales 
relied  mainly  on  Rogers’  statements  of  1957.  As  already 
mentioned  in  this  chapter,  research  evidence  prompted  the 
revision  of  the  three  therapeutic  conditions  from  the 
original  "empathy,  unconditional  positive  regard  and  con* 
gruence"  to  "accurate  empathy,  warmth  and  genuineness." 

Research  for  reliable  scales  that  would  specify 
degrees  of  these  three  factors  began  at  the  University  of 
Wisconsin,  continued  at  the  University  of  Kentucky  and 
then  at  the  University  of  Arkansas.  Truax  was  the  director 
of  the  project,  sharing  the  responsibility  for  a time  with 
Rogers,  then  Carkhuff  and  Wargo. 

The  three  ratings  scales  have  not  changed  since 
1962  (Truax  and  Carkhuff,  1967).  In  reviewing  the  devel- 
opment of  the  scales,  primary  attention  will  bo  given  to 
the  Accurate  Hmpathy  Scale  which  is  the  subject  of  the 


Aoeurate  Empathy  Seal« 

The  unique  aspect  of  the  Accurate  Empathy  Scale 
is  that  it  focuses  on  the  responses  rather  than  the  in- 
tention or  attitude  of  the  therapist  and  is  a measurement 
of  his  understanding  of  the  patient  as  indicated  in  these 
responses.  Thus,  accurate  empathy  involves  more  than  the 
therapist’s  ability  to  sensitively  experience  the  private 
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world  o£  the  client.  It  also  means  that  the  therapist 
can  do  this  with  accuracy  and  that  he  can  communicate  this 
understanding  to  the  client  in  terras  that  are  meaningful 
to  him  and  in  tune  with  his  current  feelings  CTruax  and 
Carihuff,  1967). 

To  categorize  the  degrees  of  such  empathic 
understanding,  Truax  defined  nine  degrees  of  accurate 
empathy  (Appendix  C)  beginning  with  an  almost  complete 
lack  of  empathy  and  continuing  to  a level  where  the  thera- 
pist can  he  considered  unerringly  responsive  to  the  client's 
full  range  of  feelings,  even  to  the  point  of  discriminating 
shades  of  emotional  response  and  deeply  hidden  feelings. 

The  application  of  these  scales  to  therapeutic 
interviews  was  carried  out  by  selecting  short  excerpts 
from  tape-recorded  interviews  which  were  then  judged  by 
trained  raters.  The  brief  saiGples  were  usually  of  three 
minutes'  duration;  one  taken  from  the  middle  one-third  of 
the  interview  and  one  sample  from  the  middle  of  the  final 
one-third  (Rogors  at  at.,  1967,  p.  51).  Studies  have  shown 
Chat  these  parts  of  the  therapeutic  sessions  usually  in- 
volve the  least  non-therapy-oriented  interchange  (sample 
segments  are  in  Appendix  D).  In  certain  studies  a re- 
qulraraent  of  at  least  two  therapists'  statements  in  each 
oxcerpt  was  made.  These  excerpts  were  then  arranged  in 
a random  order  before  being  submitted  for  rating. 
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The  procedures  for  training  an  individual  rater 
have  already  been  discussed.  Here  it  should  be  mentioned 
that  individual  raters  would  spend  a number  of  hours,  some- 
times as  many  as  ten,  in  instruction  on  the  construct  of 
empathy  that  he  would  be  rating,  and  in  the  use  of  the 
scale.  He  would  listen  to  segments  of  interviews  which 
had  already  been  rated  by  others  in  order  to  get  a feeling 
for  the  range  of  qualities  of  accurate  empathy  which  he 
would  be  judging,  hlien  he  had  attained  a certain  level  of 
proficiency,  he  was  then  assigned  to  a rater  group,  issued 
a recording  machine  and  earphones  and  put  to  work.  Me  would 
rate  some  given  number  of  interview  segments  in  terms  of  the 
one  quality  for  which  he  had  been  trained  (Rogers,  at  al., 
1967). 

Precautions  were  taken  to  prevent  the  raters  from 
acquiring  knowledge  of  tho  hypothesis  under  study.  Raters 
were  chosen,  as  mentioned,  from  undergraduates  who  were  not 
sophisticated  in  the  field  of  psychology.  This  procedure 
was  followed  to  assure  the  objectivity  of  the  study  and 
that  results  could  be  replicated  in  a future  study. 

Raliability  and  Validity  of  the  Aeeurate  Empathy  Saate 

Interrater  reliabilities  wove  obtained  tor  the 
accurate  empathy,  nonpossessive  warmth,  and  genuineness 
scales  over  the  past  nine  years  (Appendix  F).  The  co- 
efficients for  the  Accurate  Empathy  Scale  are  high,  with 


samples  were  selected  and  rated  as  previously  described. 

The  findings  indicated  that  the  psychotherapists  whoso 
patients  improved  as  rated  by  the  personality  tests  were 
rated  consistently  higher  (p  < ,01)  on  accurate  empathy 
than  those  whose  patients  deteriorated.  Truax  also  notes 
that  the  level  of  the  conditions  offered  by  the  therapists 
did  not  vary  substantially  during  the  six-month  period. 

Later  studies  (Truax.  1963;  Bergin  and  Solomon, 

1963;  Truax.  Wargo,  Frank,  tnher,  Battle,  Ilochn-Saric, 

Nash  and  Stone,  1966a  and  1966b)  also  confirmed  the 
validity  of  the  Instrument,  The  external  criteria  used 
in  these  studies  included  psychological  test  change  data, 
diagnostic  evaluations  of  personality  change,  measure  of 
time  actually  spent  In  the  hospital  since  beginning  therapy, 
and  outcome  as  judged  by  therapists'  supervisors. 

The  Truax  et  (1966a)  study  conducted  at  John 
Hopkins  not  only  suggested  that  patients  receiving  high 
conditions  tended  to  show  positive  change,  but  also  that 
a higher  percentage  of  patients  whose  therapists  furnished 
low  conditions  tended  to  show  negative  change.  Therefore, 
this  study  also  supports  the  allegation  that  psychotherapy 
can  be  "for  better  or  for  worse."  A review  of  outcome 
measures  on  the  therapeutic  effectiveness  of  accurate 
empathy  is  seen  in  Appendix  G. 


Curr&nt  ReBsaFOh 

This  scalo  has  been  usod  for  other  purposes  than 
only  evaluation  of  therapists  and  therapeutic  outcomes. 
Studies  on  teachers'  relations  with  students  (Aspy,  1965; 
Aspy  and  lladlock.  1966;  Coon-Tcters.  1969),  untrained 
community  social  workers  and  counselor  aides  (Stoffer,  1968; 
Carkhuff,  1968;  Carkhuff  and  Truaa,  196Sb).  family  relations 
(Shapiro.  Krauss  and  Truax.  1969),  oven  college  roommate 
influence  (Shapiro  and  Voog,  1969}  have  shown  that  where 
all  three  conditions  and  especially  empathy  are  high,  the 
persons  rated  have  produced  outcomes  significantly  higher 
than  those  whose  conditions  of  empathy,  warmth  and  genu- 

An  area  where  the  scales  have  been  predictive  of 
outcome  is  seen  in  an  unusual  study  by  Shapiro  and  Voog 
(1969).  It  was  found  that  among  resident  college  students, 
the  high  or  low  facilitating  roommate  affected  the  grade- 
point  average  of  his  companion  and  in  the  same  respective 
high  or  low  direction.  Thus,  as  Truax  and  Mitchell  (1970) 
assert  in  reference  to  this  study,  the  inherently  helpful 
person,  as  measured  by  those  scales,  does  influence  others 
in  a positive  way. 

In  concluding  this  review  of  the  literature,  it 
should  be  pointed  out  that  the  rators  tor  all  of  the  above 
studies  wei-e  specially  trained.  The  scales  themselves  were 


considered  coisplicatod  and  thought  to  require  extensive 
training. 

If  raters  with  no  training  were  able  to  correlate 
somewhat  with  trained  raters,  it  may  be  that  these  qualities 
of  empathy,  warmth  and  genuineness  are  such  common  human 
qualities  that  they  can  be  recogniied  by  the  general 
population.  This  would  reduce  the  variability  in  this 
study  so  that  differences  according  to  the  specified 
variables  of  education,  social  status  and  personal  authori- 


accurately  identified. 


CHAPTER 


PROCEDURE 

The  aim  of  this  study,  to  evaluate  the  validity  of 
cross  comparisons  between  studies  of  absolute  values,  re- 
quired accurate  empathy  ratings  of  professional  counseling 
interviews  by  nonprofessional  raters  with  specified  person- 
ality, educational  and  socio-economic  variables. 

The  raters  were  drawn  from  three  levels  of  educational 
attainment:  Junior  College  students;  undergraduate  univer- 
sity students  in  the  College  of  Education;  and  graduate 
students  In  Counselor  Education.  In  all,  ninety-eight 
raters  took  part  in  the  present  rosearch.  Of  the  ninety- 
eight,  eight  subjects  had  to  be  eliminated  from  the  study 
because  they  failed  to  complete  all  sections  on  their  answer 
sheets.  This  loft  thlvty-oight  Junior  College  students, 
twenty-five  undergraduates  and  twenty-seven  graduate  stu- 
dents whose  ratings  and  background  data  were  correctly 
filled  out. 

Raters  for  studies  using  Truax's  scales  have  been 
drawn  predominately  from  students  attending  college.  The 
reason  for  this  is  that  they  are  clinically  naive  and  thus 
they  are  more  llkoly  to  rate  Interviews  on  the  specified 


scale  rather  than  on*other  aspects  of  the  therapeutic 
process — a tendency  found  among  professional  therapists 
(Rogers,  Gendlin,  Kiesler  and  Truax,  1967,  p.  54;  Taft, 
19S5).  Another  reason  is  that  college  students  are 
available  in  large  numbers. 

The  division  of  Junior  College,  undergraduate  and 
graduate  students  seeks  to  establish  points  along  the 
dimensions  of  educational  levels.  Since  most  judges  in 
studies  completed  to  date  are  drawn  from  these  categories, 
this  range  of  educational  attainment  is  particularly  relO' 


The  backgrounds  of  the  raters  used  in  this  study 
represented  all  areas  of  the  United  States,  and  six  raters 
wore  from  foreign  countries.  At  the  time  of  the  study, 
they  were  oil  students  in  Gainesville,  Florida.  The  raters 
wore  students  from  Santa  Fe  Junior  College,  undergraduate 
students  in. the  School  of  Education  and  graduate  students 
in  Counselor  Education  at  the  University  of  Florida. 

The  raters  were  not  drawn  from  random  samples;  but 
rather  for  the  sake  of  convenience  whole  classes  were  asked 
to  participate. 

There  were  twenty-four  malos  and  forty-eight  females. 
The  proportions  for  the  respective  groups  were  as  follows: 


35  - 


Junior  College  students — fourteen  males  to  ten  females; 
undergraduates — tvo  males  to  twenty-two  females;  and 
graduate  students — eight  males  to  sixteen  females. 

The  overall  mean  age  of  the  raters  was  24.3  years. 
The  mean  age  for  each  respective  group  was  as  follows; 
Junior  College  students. — 25.5  years;  undergraduates — 20.9 
years;  and  graduate  students — 26.5  years.  The  moan  ago 
of  Junior  College  students  is  high  because  there  were  two 
students  in  their  late  thirties  and  two  in  their  fifties. 

The  research  design  for  this  study  called  for 
twenty-four  in  each  of  the  three  educational  status  cate- 
gories. The  levels  of  educational  status  wore  subdivided 
by  high  and  low  socio-economic  status,  which  in  turn  was 
subdivided  into  high  and  low  personal  authoritarianism 
(Table  1).  These  divisions  will  be  discussed  later.  In 
selecting  seventy-two  subjects  from  the  total  of  ninety 
raters,  fourteen  Junior  College  students,  ono  undergraduate 
and  three  graduate  students  were  dropped  from  the  study. 
This  was  done  by  eliminating  those  subjects  whose  personal 
authoritarianism  scoreswere  closest  to  the  cutoff  point 
between  high  and  low.  The  resulting  division  of  cells  is 


Table  1. 


TABLE  1 

Division  of  Raters 


Socio-economic  Authoritarian  F Scale 

Subjects  N Status  N N X SD 


Professional 

Junior  College  24 


10  Hi 
Lo 


Hi  9 *15 


Professional 


Graduate 

Students 


Professional 


Total  Hi 
Total  Lo 


?»TeoTialit\j  Variables 


The  measurement  of  variables  that  might  Influence 
ratings  was  considered  from  throe  aspects  based  on  the 
moasurement  of  conditions  ^'hich  form  personality,  and  the 
measurement  of  a specific  personality  variable  considered 
most  likely  to  have  an  effect  on  ratings.  Environmental 
background  as  measured  by  educational  and  socio-economic 
status  fulfilled  the  former  requirement.  The  latter 
variable  chosen  was  rigidity  of  personality  or  authori- 
tarianism. 


Saaio-SooaoKio  Claasifiaatioe 

Shartle  (1959)  notes  that  the  social-economic 
groupings  as  seen  in  the  Diotionarj^  of  oaoupational  titles 
are  useful  for  comparison  between  populations.  The  envi- 
ronmental variations  that  accompany  socio-economic  dif- 
ferences affect  the  formation  of  personality,  because 
personality  characteristics  are  predominately  products 
arising  from  empariences  with  people  (Combs  and  Snygg,  1959, 
p.  125  £f).  Thus  socio-economic  status  was  chosen  as  the 
variable  that  might  affect  absolute  level  of  ratings. 

Socio-economic  status  was  classified  according  to 
Edward's  socio-economic  grouping  prepared  for  the  Bureau 


of  Census  (Shartle,  1959,  p.  143). 
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CO  D.O.T.  Group  1 8 
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with  "Professionals."  Contrary  to  expectation  the  sample 
contained  a preponderance  of  subjects  where  families  wore 
in  the  "Professional"  group;  therefore,  the  division  was 
made  between  "Professional"  and  "Other."  The  breakdown  of 
socio-economic  class  of  those  subjects  used  in  the  study 
is  seen  in  Table  2. 

In  choosing  a personality  variable  likely  to  influence 
ratings,  rigidity  of  personality  was  considered  to  be  the 
most  significant.  The  rigid  person  is  one  who  finds  if 
difficult  to  accept  other  people's  values  which  may  con- 
flict with  his  own.  This  rigidity  often  is  monifestod  in 
the  condemnation  of  others  who  do  not  accept  the  conventional 
values  which  the  rigid,  authoritarian  person  accepts  uncrit- 
ically. As  Adorno  (1950,  p.  255)  points  out,  the  authori- 
tarian personality  finds  it  difficult  to  understand  another's 
personal  feelings.  Thus,  it  was  anticipated  that  high  levels 
of  such  dimensions  would  influence  the  authoritarian  person's 

Adorno's  anti-democratic  F scale  was  chosen  as  a 
measure  of  rigidity  of  personality  or  authoritarianism 
(Appendix  A).  Prom  the  earlier  studies  on  antisemitism, 

Adorno  came  to  the  conclusion  that  "authoritarianism 


;n!s;sr;?  Hi 


may  be  interpreted  to  mean  that  the  scale  can  place 
individuals  along  a dimension  with  a small  margin  of  error. 
The  validity  of  this  instrument  is  based  on  two  case  studies. 

Of  the  ninety  subjects  originally  involved  in  the 
study,  Junior  College  students  registered  over  a wide  range 
on  Adorno's  Authoritarian  F Scale  with  mostly  "highs"  and 
few  "lows"  as  compared  with  the  undergraduates  and  graduate 
students.  The  diagram  in  Table  3 shows  the  number  of  raters 
who  scored  within  each  ten  points  on  the  Authoritarian  F 

This  wide  range  of  authoritarianism  was  divided  into 
"high"  and  "low"  categories.  The  cell  numbers  with  means 
and  standard  deviations  are  as  shown  previously  in  Table  1. 

The  raters  filled  out  Adorno's  Authoritarian  F 
Scale  after  completing  the  evaluation  of  the  taped  segments. 

Tpuax  Aaaurata  Empathy  Scats 

The  historical  bnchground  and  development  of  the 
Accurate  Empathy  Scale  has  already  been  discussed  in  the 
previous  chapter.  It  will  suffice  at  this  point  to  give 
a brief  description  of  the  scale  Itself. 

Truax's  nine-point  scale  of  accurate  empathy 
ranges  from  a lo\?  point  where  the  counselor  manifests  a 
complete  lack  of  empathic  understanding,  almost  to  the 


point  of  treating  the  client  as  a thing  without  foollngs 
(stage  1),  to  a high  point  (stage  9)  where  the  therapist 
responds  accurately  and  unhesitatingly  to  the  universe  of 
client's  expressed,  iinplied  and  even  preconscious  feelings. 
As  Kiesler  explains, 

a high  level  of  accurate  empathy  requires  in  effect 
that  the  therapist  understand  the  patient's  phenomenal 
world,  integrate  this  material  fully  into  his  own  frame 
of  reference,  and  communicate  this  integration  to  the 
patient.  This  is  not  to  say  that  the  therapist  must 
respond  with  the  same  intensity  or  range  as  the  patient 
does,  but  that  he  Incorporate  the  patient's  feelings 
into  his  own  awareness.  At  a high  level  of  accurate 
empathy  the  message  "I  am  with  you"  is  unmistakenly 
clear — the  therapist's  remarks  fit  compatibly  with 
the  patient's  mood  and  content  (Kiesler  ae  aZ., 

1967,  p.  139). 

Melloh  (1964)  in  his  schematic  presentation  of  the 
scale  (Appendix  6)  seems  to  go  beyond  Kiesler  et  at. 'a 
statement.  The  high-level  therapist  perceives  and  reflects 
not  only  present  obvious  and  veiled  feelings  but  even 
preconscious  feelings.  Meiloh's  schematic  presentation 
doscribes  accurate  empathy  on  three  levels:  present 
obvious  feelings,  veiled  feelings  and  preconscious  feelings. 
The  first  level  discriminates  between  stages  1 to  5,  the 
second,  which  overlaps  the  first,  is  useful  in  evaluations 
from  stage  3 to  stage  7.  Tlic  third  level,  preconscious 
foolings,  covers  stages  7,  8 and  9.  As  Truax  points  out  I 
(1967,  p.  59),  this  schematic  presentation  "has  been  found  \ 
useful  foi'  both  resoarch  raters  and  therapist  trainees. 


It  provides  a brief  Summary  of  the  table  scale,  and  is 
intended  to  facilitate  the  training  of  raters  in  the 
USB  of  the  scales 

RataT  Training 

To  allow  full  reign  for  the  effects  of  educational 
level,  socio-economic  status,  and  personal  authoritari- 
anism, to  influence  absolute  levels  of  empathy  ratings, 
only  a brief  thirty-minute  period  of  training  was  used  j 
In  the  present  study. 

The  present  procedure  is  in  contrast  to  the 
rater-training  in  previous  studies.  Ordinarily,  raters 
have  been  trained  over  a longer  period  of  time,  up  to 
three  hundred  hours.  After  becoming  familiar  with  the 
descriptive  stages  of  the  constructs,  each  rater  was 
assigned  to  rate  practice  materials  under  the  supervision 
of  the  trainer.  In  prior  studies,  only  when  Interratcr 
correlations  were  considered  adequate  were  the  raters 
assigned  to  a specific  study. 

for  this  study,  however,  Helloh's  schematic 
breakdown  was  used  in  explaining  the  scales  to  the  raters. 
It  was  decided  that  Truax's  verbal  description  of  the 
scale  (Appendix  C)  would  be  too  thorough  for  the  raters 
to  absorb  adequately  in  the  thirty-minute  training  period 
allowed.  To  Cacilitate  the  use  of  Melloh's  schema,  the 


summary  of  the  scale -was  placed  at  the  top  of  the  evalu- 
ation sheet  with  stages  ranging  from  1 to  9 corresponding 
with  each  segment  [Appendix  DJ . The  raters  were  instructed 
to  study  the  diagrammatic  ratings , choose  one  and  circle 
the  score  on  the  line  corrosponding  to  the  specific  taped 
segment. 

The  trainer  explained  the  use  of  the  scale  to  the 
raters,  using  a structured  format  that  could  be  easily 
repeated.  The  raters  wore  encouraged  to  use  the  scale  as 
a continuum  and  place  a check  mark  at  any  point  on  a line 
marked  off  between  1 and  9.  Each  period  between  the  digits 
on  the  continuum  represented  tenths  of  a scale  point,  so 
the  scale  in  effect  was  a 90-point  scale. 

The  tapes  were  rated  and  the  raters  trained  in 
groups  of  six  to  thirty.  The  standard  explanation  of  the 
scale  was  accompanied  by  a demonstration  of  three  taped 
segment  samples.  After  each  sample  segment  the  raters 
discussed  their  evaluations.  No  effort  was  made  to 
establish  a correct  score,  or  to  enforce  the  majority 
opinion  on  all.  Instead  the  scale  as  it  was  interproted 
by  each  rater  was  the  only  guideline. 

The  trainer  allowed  for  questions  on  his  explanation 
of  the  scale  before  playing  the  eighteen  interview  segments 
used  for  rating  accurate  empathy  in  the  present  study. 

There  was  a five  minute  rest  period  after  the  ninth 
interview  h.sd  been  evaluated. 
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To  develop  standardized  procedures  for  raters' 
training  and  ratings,  the  entire  procedure  was  pretested 
on  a group  of  eight  graduate  students  (not  included  in  the 
study).  Minor  modifications  were  made  in  the  training 
method  at  graduate  students'  suggestion. 

Sfleatian  of  Taped  Segments 

Taped  segments  used  in  this  study  came  from  two 
sources  Csaraple  segments  ore  in  Apendix  E) . Seventeen 
taped  segments  of  professional  therapeutic  interviews 
from  the  University  of  Arkansas  were  reviewed.  These 
tapes  were  three-minute  random  samples  of  interviews 
which  had  been  rated  by  six  sets  of  qualified  raters. 

Of  the  seventeen  taped  segments,  six  poorly 
recorded  ones  were  eliminated  because  of  difficulty  in 
hearing  the  therapist  and  client  responses.  The 
remaining  eleven  were  rated  as:  one  segment  at  stage  2; 
three  segments  at  stage  S;  two  segments  at  stage  4;  one 
at  stage  Sj  two  at  stage  6;  one  at  stage  8;  and  one  at 

Seven  of  these  segments  were  from  the  National 
Institute  of  Mental  Health  at  the  University  of  Arkansas 
and  were  taken  from  interviews  ranging  from  the  third  to 
the  eleventh  session.  The  remaining  four  segments  from 
this  source  were  from  a vatcr-training  tape  used  at  the 


research  center. 


The  remaining  seven  segments  were  chosen  from 
interviews  conducted  by  counselors  - in-training  at  the 
University  of  Toronto.  They  too  were  selected  by  the 
researcher  to  represent  a wide  range  on  the  Accurate 
Empathy  Scale,  but  had  not  bean  rated  according  to  the 

There  were  fourteen  male  therapists  and  four 
female.  Three  of  the  female  therapists  had  male  clients. 
Of  the  six  female  clients  only  one  had  a female  therapist. 

Details  for  each  segment  are  given  in  Table  4. 

All  the  segments  represented  personal  counseling.  The 
last  seven  excerpts  were  from  interviews  with  high  school 
students  and  were  centered  around  personal  problems 
pertaining  to  scliool.  The  problems  varied  in  their 
intensity.  Degrees  of  intensity  are  presented  in  Table 
4 as:  very  serious  problem  with  emotional  presentation, 

4;  very  serious  content  but  not  emotionally  presented,  Sj 
content  serious,  Z and  content  not  serious,  1. 

Other  Variablea 

Besides  the  educational  and  personality  variables 
previously  described,  rating  differences  caused  by  sex, 
source  of  support  (i.e.,  father,  mother  or  guardian^  and 
age  wore  computed.  Each  of  these  factors  might  affect 
personality  and  thus  the  absolute  judgment  of  tlic  raters. 


2 

4 

6 

7 

9 

10 

14 

15 

16 

17 


TABLE 


Types  o£  Taped  Segments 


Origin  Sex  of  Sex  of 
of  Tape  Therapist  Client 


NIMH  M M 

Arkansas 

NIMH  M F 

Arkansas 

NIMH  M H 

Arkansas 

NIMH  M F 

Arkansas 

NllUI  M M 

Arkansas 

Arkansas 


Arkansas 

Arkansas  M 

Training 

Arkansas  M 

Training 

Arkansas  F 

Training 

Arkansas  M 

Training 

Toronto  F 

Toronto  M 

Toronto  M 

Toronto  M 

Toronto  M 

Toronto  H 

Toronto  F 


Tape  Intensity 
Rating  of  Problem 


The  importance  of  sex  and  age  differences  on  personality 
is  generally  accepted.  Personality  tests  are  developed 
with  this  in  mind.  Thus  in  interpreting  raw  scores  of 
personality  tests,  allowance  is  made  for  differences  in 
sex  and  ago.  Source  of  support  is  analogous  to  socio- 
economic groupings  but  not  included  in  them.  The  effect 
of  a fatherless  childhood  environment  on  the  personality 
of  raters  was  chosen  as  a variable  to  be  studied. 


CHAPTBR  III 


RESULTS 

This  study  primarily  examined  the  influence  of 
educational  status,  socio-economic  class  and  degree  of 
personal  authoritarianism  on  the  ratings  nouprofessional 
judges  gave  to  eighteen  taped  three-minute  segments  of 
professional  counseling  interviews.  The  judges  were 
trained  in  the  use  of  the  Truax  Accurate  Empathy  Scale 
which  was  used  in  evaluations  of  the  segments. 

The  methods  used  In  training  the  raters  and  the 
manner  in  which  the  data  were  collected  have  been 
discussed  in  the  previous  chapter.  Each  rater  only  had 
a thirty-minute  briefing  to  familiarito  himself  with  the 
scale.  The  eighteen  segments  were  then  presented  for 
ratings. 

AnalyaiB  of  Data 

The  research  design  (Figure  1)  forms  a 3 « 2 a 2 
factorial  design  with  eighteen  repeated  measures  with 
subjects  nested  within  oach  of  the  cells.  The  interactions 
(Figure  2— Partition  of  Sum  of  Squares)  study  seven 
possible  interactions  of  main  variables  which  may  have 
an  influence  on  the  raters.  The  means  and  standard 

each  of  the  taped  segments 


deviations  wore  computed  fc 


of  the  combined  groups,  and  of  each  category  according 
to  education,  socio-economic  status  and  degree  of 
au thoritarian ism. 

A one-way  analysis  of  variance  was  computed  for 
sex  differences  [Male:  24,  Female;  47)  and  source  of 
support  (Father:  61,  Mother:  8.  and  Guardian:  3). 

The  final  computation  was  a correlation  between 
age  and  total  judges’  score  to  see  if  there  is  a relation- 
ship between  chronological  maturity  and  segment  evaluations. 

Data  Validity 

For  each  of  the  eighteen  repeated  measures  (taped 
segments),  the  standard  deviations  of  the  judged  values 
varied  from  1.23  to  1.92  on  the  nine-point  scale  (Table  S). 
Eleven  of  the  eighteen  taped  segments  had  been  prerated  by 
qualified  judges  at  the  University  of  Arkansas.  Corre- 
lation of  these  eleven  ratings  between  the  standard  raters 
at  Arkansas  and  the  mean  scores  for  the  same  tapes  by  the 
raters  in  the  present  study  was  r = .77  (Figure  3). 

Bypothaaet 


Educational  Statua 

Differences  among  the  means  of  educational 
status  on  ratings  of  taped  segments  were  significant  at 
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TAPES  1-18  Junior  College  with  r ■ .87 

Undergraduates 

Junior  College  with  r = .75 

Graduates 

Undergraduates  with  r ■ .51 

Graduates 


TAPES  1-11  Professional  Judges  with  r = .77 

Combined  Groups 

Professional  Judges  with  r = .80 

Junior  College 

Professional  .Judges  with  r =■  .70 

Undergraduates 

Professional  Judges  with  r = .77 

Graduates 


Figure  J.  Intergroup  Correlations 


the  .05  level  CTable  6).  The  first  hypothesis  predicted 
differences  among  means,  but  with  graduate  students  being 
significantly  higher  than  undergraduate  students,  and 
undergraduates  significantly  higher  than  Junior  Collego 
students.  However,  the  results  indicated  that  under- 
graduates' overall  mean  of  5.03  was  higher  than  the 
graduates'  mean  of  i.?!  and  the  Junior  College  students' 
mean  of  4.47. 

"Scheffd's  comparisons"  were  computed  to  determine 
where  significant  differences  were  indicated.  A comparison 
of  the  mean  of  undergraduates  with  the  mean  of  Junior 
College  students  (which  seemed  to  be  whore  the  differences 
occurred)  resulted  in  no  significant  differences  (F  ■ 4,67, 
p > .05).  The  only  significant  difference  occurred  in  the 
comparison  between  differences  in  means  of  the  under- 
graduates, versus  a combination  of  Junior  College  and 
graduate  students  (F  ■ 7,24,  p < .05). 

Correlations  of  mean  values  of  the  eighteen  taped 
segments  wore  computed  between  each  education  variable 
(Figure  5).  The  highest  correlation  was  between  under- 
graduates and  graduate  students  (r  ■ .91),  while  Junior 
Collego  students  and  undergraduates  were  next  (r  " .87), 
and  Junior  College  and  graduate  students  were  the  farthest 
apart  (r  = .75) . 


TABLE  6 
Summary  Table 


Education 
Soclo-ec anomic 
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589S  1 707 

1874  2 1631 

77  2 2089 

7029  1 95 

2 1440 


5,2353  tp  < -OS) 


1 « 2 X 3 2,880 

Raters/groups  55,373.97  (60)  922.8995 
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7437  1.76 

8850  2.0644 

7029 

0333  1.56 
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Mean  values  for  Individual  taped  segments,  as 
seen  in  Table  5,  vary.  Undergraduates  gave  higher  values 
on  the  average  than  the  other  educational  groups.  ^Junior 
College  X = 4.42;  Undergraduates  X = 5.03;  Graduate 
students  X =•  4.73.J 

Correlations  of  the  mean  values  for  each  educational 
group,  between  the  first  eleven  taped  segments  and  the 
segments  judged  by  standard  raters  at  the  University  of 
Arkansas,  were;  Junior  College  r - .80;  Undergraduates 
r ■ .70;  Graduate  students  r = ,77.  Although  they  varied 
to  some  extent  in  absolute  ratings,  their  relative  ratings 
were  in  close  agreement  with  more  extensively  trained 
standard  raters  ^Figure  3). 

Variability  within  each  educational  group  can  be 
seen  from  the  mean  standard  deviations  of  each  group: 

Junior  College  S.D.  ■ 1.54;  Undergr.sduates  S.D.  ” 1,49; 
Graduates  S.D,  * 1,51. 

No  differences  in  rating  were  obtained  between 
high  and  low  Socio-Economic  Status  (F  “ < 1;  Table  O'), 
"Professional"  group  ratings  on  each  segment  had 
variability  scores  ranging  from  S.D.  " 1.09  to  S.D.  = 2.01 
with  a mean  S.D.  » 1,56  CTable  7j  . "Other"  group  ratings 
on  each  segment  ranged  from  S.D.  = 1.30  to  S.D.  = 1.93  with 
a mean  S.D.  ° 1 .55. 


TABLE  7 


Moans  and  Standard  Deviations  of  Taped  Segments 
According  to  Socio-Economic  Status 


"Profess  Lonal " 
Socio-Economic  Status 


Tape  No.  Mean  S.D, 


Total 


1.45 

7 • 1.56 


Socio-Economic  Status 


10 

12 

13 


16 


5.34 

5.34 

3.07 

3.08 


1.38 

1.49 


1.30 

1.59 

1.41 

1.93 


1.49 

1.42 

1.52 


1.33 

1.63 

y = 1 
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TABLE  8 

Mesns  and  Standard  Deviations  of  Taped  Se|iiicnts 
According  to  High  and  Lo«  Autlioritarianism 


High  Authoritarlanlsra 


10 

11 

12 

13 

15 

16 

18 

Total 


S.SO 

4.81 

6.47 

4.71 

4.S6 

5.53 

X ■ 4.95 


1.97 

1.54 

1.37 
1.36 
1.56 
1.04 

1.38 


X = 1.50 


Authoritavianiem  f Scale 

Differences  in  the  degree  of  personal  authori- 
tarianism also  causod  no  significant  differences: 

F = < 1 (Table  6).  Standard  deviations  for  the  high 
authoritarian  raters  varied  from  5.D.  - 1.04  to  5.D. 

= 1.97  with  a mean  S.D.  " 1.50.  Low  authoritarian  raters 
ranged  from  S.D.  = 1.30  to  S.D.  ■ 2.21  with  an  average 
S.D.  = 1.S9  (Table  80 . 

To  test  the  effect  of  extreme  differences  in 
personal  authoritarianism  or  rigidity  of  personality  on 
raters,  an  analysis  of  variance  using  total  raw  scores 
was  computed  between  the  nine  least  and  the  nine  most 
authoritarian.  The  F ratio  was  less  than  1 and  thus  not 
significant  (Table  9). 

TABLE  9 

Analysis  of  Variance  CDxtrcmo  Authoritarian  Groups) 


High  Authoritarian  9 

Low  Authoritarian  9 

Sum  of  Square  df 

Between  Group  1820.06  1 

hithin  Group  320293.56  16 

Total  322113.62  17 


1820.06  0.09 

20018.34 


(F  . 2.06,  p !■  .OSJ, 


(Table  6). 
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There  were  twenty-four  inales  and  twenty-seven  females. 

F ratio  was  3.3204.  approaching  but  not  reaching  signifi- 
cance at  .05  level  {Table  10). 

father.  Mother  or  Guardian  a>  Means  of  Support 

A one-way  analysis  of  variance  was  used  to 
establish  any  influence  traceable  to  differences  in  means 
of  support.  Sixty-one  of  the  raters  reported  "Father”  as 
the  breadwinner,  eight  considered  "Mother"  their  source 
of  financial  support  and  three  were  supported  by  guardians. 
The  F ratio  was  1.42  and  not  significant  at  .05  level 
(Table  11). 

The  influence  of  age  on  overall  ratings  of  taped 
segments  was  computed.  Raters’  ages  were  correlated  with 
total  rating  values  on  all  eighteen  taped  segments.  A 
nonsignificant  minus  correlation  of  r = -0.14  was  obtained. 


CHAPTER 


SUMMARY,  CONCLUSIONS.  AND  IMPLICATIONS 
Summavy 

The  principal  aim  of  this  study  was  to  investigate 
specified  rater  variables  that  might  influence  rater  inter- 
pretation of  the  Accurate  Empathy  Scale.  Thus  the  objective 
was  to  determine  the  validity  of  comparing  results  from 
separate  studies  utilizing  the  Accurate  Empathy  Scale 
when  the  raters  for  each  study  were  different.  In  other 
words,  is  the  Accurate  Empathy  Scale  relative  or  absolute? 

It  is  a reasonable  supposition  that  certain 
personality  and  other  variables  would  affect  the  ratings. 
Since  a long  period  of  training  would  tend  to  minimize 
these  differences,  the  present  study  used  only  a thirty- 
minute  period  of  training.  If  personality,  socio-economic, 
educational,  or  other  variables  affect  the  absolute  values 
of  the  scale,  then  ratings  could  not  be  validly  compared 
from  study  to  study. 

Thus,  tho  present  study  used  a factorial  analysis 
design  to  investigate  rater  variables  which  may  signifi- 
cantly influence  the  absolute  ratings  of  empathy.  Signifi- 
cant differences  at  the  .05  level  were  predicted.  The 


variables  that  were  judgail  to  be  the  nost  influential  in 
this  study  were  educational  atcainmont , environmental 
background  and  degree  of  personal  authoritarianism. 
Raters  were  drawn  from  Junior  College,  university  under- 
graduate and  graduate  students.  There  was  equal  repre- 
sentation froffl  each  educational  level.  Tho  raters  were 
then  subdivided  by  environmental  background  (socio- 
economic high  and  low)  and  high  and  low  on  the  Authori- 


All  tho  raters  wore  previously  unfamiliar  with 
the  nine-stage  Accurate  Ismpathy  Scale.  In  groups  of  six 
to  thirty,  they  wore  introduced  to  the  scale.  A thirty- 
minute  familiarication  period  using  a structured  format 
describing  each  stage  was  all  the  training  they  received. 

Eighteen,  three-minute,  audio-taped  excerpts  from 
professional  counseling  and  psychotherapy  interviews  were 
then  evaluated  for  accurate  empathy  by  the  seventy-two 
raters.  After  completing  the  evaluations,  the  raters 
indicated  tho  socio-economic  status  of  their  families 
and  responded  to  Adorno’s  authoritarian  statements  from 
which  their  positions  on  the  F scale  were  computed. 

Bypathgaeo 

this  study  examined  three  major  hypotheses  and 
three  additional  variables  aimed  at  discovering  if  rater 


lieaulte 

This  study  hypothesized  differences  sjaong  rating 
means  for  all  three  categories;  educational  attainment, 
socio-economic  status  and  degree  of  porsonai  authoritarian- 

Hypothesis  I predicted  the  differences  on  the 
educational  variable  would  be  in  the  direction  of  the 
graduate  students  evaluating  highest,  followed  by  under- 
graduates and  the  Junior  College  students.  Although 
computations  indicated  a significant  difference  (Tabic  6), 
tests  to  establish  where  the  differences  occurred  showed 
that  there  woro  no  pair-wiso  differences  among  the  groups. 
However, "Scheffd's  comparison^' of  the  means  of  the  under- 
graduates versus  a combination  of  the  means  of  the  Junior 
College  and  graduate  students  indicated  significant  dif- 
ferences (F  « 7.24,  p < .05).  One  explanation  for  these 
differences  could  be  the  effect  of  rater  age.  The  mean 
egos  for  tho  groups  (Figure  J)  shows  the  graduate  students 
(26.5)  as  oldest,  followed  by  Junior  College  (25. S)  and 
undergraduate  students  (20).  The  mean  ages  for  tho  Junior 
College  and  graduate  students  approximate  each  other,  with 
the  undergraduate  students  significantly  younger,  thus  those 
difforonces  could  be  the  effect  of  age.  If  the  professional 
judge  ratings  are  accepted  as  tho  criterion  of  rating 
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accuracy,  Chen  It  can  be  noted  that  the  youngest  group 
(undergraduate  students)  has  the  lowest  correlation  with 
the  professional  judges  (r  • .70).  The  other  two  groups 
of  raters  agree  more  closely  with  the  professional  judges; 
graduate  students,  r « .77;  Junior  College  students, 
r " .80.  Thus,  the  groups  with  the  older  mean  ages  would 
seem  to  be  more  accurato  in  their  ratings.  It  must  be 
noted,  however,  that  tho  computation  of  the  influence  of 
age  on  overall  ratings  of  taped  segments  produced  a non- 
significant correlation  of  -0.14. 

Since  these  differences  between  the  means  of  the 
undergraduate  students  versus  tho  means  of  tho  Junior 
College  and  graduate  students  combined  wore  only  at  the 
.05  level  and  were  not  indicated  in  the  predicted  direction, 
a further  analysis  was  made  to  establish  the  degree  to 
which  the  groups  correlated  with  each  other.  Correlations 
of  means  of  the  eighteen  taped  segments  were  computed 
between  each  educational  level;  Junior  College  with 
undergraduates  r » .87;  Junior  College  with  graduate 
students  r = .75;  undergraduates  with  graduate  students 
r ■ .91. 

The  lack  of  significant  differences  in  the  predicted 
direction  plus  the  high  correlations  that  were  found 
between  tho  rated  means  suggests  that  the  differences 
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may  be  attributed  to  chance.  In  any  event  hypothesis  I 
must  bo  rejected  in  as  much  as  there  is  no  directional 
significant  difference  between  the  means  of  graduate,  under- 
graduate, and  Junior  College  students  on  their  ratings. 

Mypotiiesis  II  predicted  that  the  mean  ratings  of 
"Professional”  socio-economic  status  raters  would  be 
significantly  higher  than  the  "Other."  There  were  no 
significant  differencos  indicated  and  thus  the  hypothesis 
docs  not  stand. 

hypothesis  III,  that  differences  in  the  degree  of 
personal  authoritarianism  would  show  those  scoring  high 
on  tho  Authoritarian  P Scale  to  have  low  mean  ratings, 
while  those  with  low  authoritarian  scores  would  rate 
higher,  must  be  rejected  since  there  were  no  differences 
indicated. 

It  seemed  reasonable  that  the  rigid  authoritarian 
person,  who  shows  little  understanding  for  others,  would 
not  perceive  empathy  in  others  os  well  as  the  non-authori- 
tarian typo.  To  check  this  thoroughly,  an  analysis  of 
variance  was  computed  between  the  authoritarian  extremes: 
the  nine  most  authoritarian  and  the  nine  least  authori- 
tarian. There  was  no  significant  difference  Cf  * ^ l)i 
thus  confirming  the  findings  that  ratings  are  not  affected 
by  this  personality  characteristic. 
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Other  Anati/aeB 

Sex  differences  approached  the  level  of  signifi' 
cancc  (F  ” 3,52),  but  source  of  support  was  not  signifi- 
cant. A correlation  of  raters'  ages  with  total  judged 
values  produced  a nonsignificant  minus  correlation  of 
r ■ - .14. 

Limitatione  of  the  Study 

The  degree  to  which  the  variables  investigated 
can  be  said  to  apply  to  all  levels  of  education,  socio- 
economic status  and  personal  authoritarianism  is  limited 
by  the  narrow  range  of  all  of  these  variables  in  the 
subjects  for  this  study.  The  variable,  educational 
level  of  attainment,  extends  from  Junior  College  to 
graduate  students — it  does  not  include  those  below 
Junior  College  level'  nor  the  vast  range  of  intellectual 
attainment  beyond  the  university  graduate  level.  The 
socio-economic  variable’s  division  of  "Professional," 
and  "OElier"  was  not  an  adequate  division  of  high  and 
low  since  there  were  a number  in  the  "Other"  category 
who  were  of  "Manager  end  Proprietor"  status.  The 
personal  autlioritarianlsm  variable  was  divided  into 
high  and  low  by  the  median.  Thus  there  was  not  an 
adequate  separation  between  high  and  low.  Kowever,  a 


second  analysis  of  variance,  using  the  oxtreioes  of  high 
and  low,  suggests  this  variable  is  adequately  represented. 

Conotusions 

The  only  significant  differences  in  the  mean 
ratings  of  tapes  were  between  the  educational  groups. 

As  indicated,  these  differences  were  between  the  under- 
graduates and  the  combined  groups  of  Junior  College  and 
graduate  students.  Although  these  differences  do  not 
seen  to  follow  any  pattern  that  would  make  then  meaningful, 
they  should  not  be  Ignored.  This  putzling  pattern  leaves 
open  the  possibility  that  aspects  of  a variable  due  to 
education,  other  than  the  division  of  Junior  College, 
undergraduate  and  graduate  students,  nay  neaningfully 
account  for  the  differences.  If  such  wore  the  case,  the 
scale  would  then  be  considered  relative  and  dependent  on 
such  a variable. 

However,  this  study  suggests  that  the  chosen 
variables  did  not  produce  absolute  differences  among 
ratings  which  wore  interpretablo.  The  overall  differences 
for  individual  taped  segnents  were  snail  considering  the 
short  training  period  (Table  S),  Although  individual 
raters  varied  to  some  extent  in  absolute  ratings,  they 


unrelated 


hypolhi; 


the  relative  ratings 


were  in  close  agreement  with  the  more  extensively  trained 
raters  from  the  University  of  Arkansas  (r  ■ .77J. 

Since  the  other  variables  of  age,  sex  and  source 
of  support  also  were  not  significant.  It  may  bo  reasonable 
to  suggest  that  the  rating  scale  is  not  influenced  by 
those  extrinsic  variables  at  least  within  the  narrow 
range  of  these  subjects  or  raters.  Khile  this  study  did 
not  consider  ail  possible  variables  that  could  affect 
ratings,  it  did  deal  with  those  which  were  considered 
most  likely  to  influence  the  ratings.  If  variables,  such 
as  the  ones  used  in  this  study,  do  not  affect  ratings,  then 
the  scale  Itself  may  be  considered  absolute  and  applicable 
to  disparate  studies  using  different  sets  of  judges  with 
some  assursnee  that  these  studies  nay  bo  compared  on  the 
basis  of  the  absolute  ratings  given.  Thus  the  validity 
of  cross  comparisons  between  studies  of  absolute  values, 
contrary  to  expectation,  is  not  sucCD5S_fuILy  challenged 
by  this  study. 

Implieations 

The  major  implication  of  this  study  is  that  the 
large  number  of  research  reports  using  the  scale  for 
accurate  empathy  may  be  compared  with  some  confidence. 

This  generalitation  of  absolute  levels  means  that 


standards  of  proficiency,  such  as  the  Diinimal  rating  for 
successful  therapy  of  4.S,  can  be  applied  with  a high 
degree  of  confidence. 

' This  study  also  inplies  that  the  training  of 

^raters  is  not  a complicated  job  requiring  many  hours  or 
a highly  skilled  trainer.  The  present  findings  suggest 
that  aliaost  any  college-level  group  of  people  can  be 
trained  quickly  to  rate  levels  of  accurate  empathy  and 
tliat  such  ratings  more  likely  can  be  compared  on  an 
absolute  as  well  as  a relative  basis. 

It  is  also  implied  that  levels  of  accurate  empathy 
have  been  so  specified  that  the  descriptive  stage  are 
easily  recognizable  by  individual  raters.  It  seems 
that  individuals,  regardless  of  tholr  degree  of  authori- 
tarianism, socio-oconoiaic  status  or  level  of  educational 
attainment,  know  and  can  recognize  empathy  even  though 
they  may  not  be  able  to  offer  the  conditions  themselves. 

It  seems  we  can  recognize  an  understanding  relationship 

Impli>7ation&  foi>  Counselor  Sdueatian 

The  ease  with  which  the  scale  can  be  taught  has 
implications  for  counselor  education.  Counselors -in- 
training  would  he  able  to  evaluate  each  other's  practicum 
tapes,  and  expecislly  their  own  interviews.  Thus  the 


counselor  vho  has  learned  to  use  these  scales  with 
confidence  has  a tool  for  self-evaluation  which  he  will 
carry  with  him  after  graduation. 

The  use  of  the  scales  for  empathy  "training,"  as  / 
has  been  suggested  by  Truax  and  Carkhuff  (19f7,  p.  293), 
is  much  more  feasible  considering  the  implications  of  this 
study.  The  counsolor-in- training  having  been  taught  how 
to  discriminate  the  various  levels  of  ompathic  responses, 
attempts  such  responses  himself  in  role-playing  situations. 
If  the  role  playing  has  been  recorded,  the  responses  can 
then  bo  evaluated  by  his  peers  as  well  as  by  him.  An  added 
value  of  this  approach  is  the  motivation  for  change  evoked 
by  the  personal  satisfaction  in  learning  and  the  approval 
of  fellow  students  and  supervisor. 

If  this  study  may  be  said  to  support  the  validity 
of  the  Accui'ate  Empathy  Scale,  then  more  consideration 
should  be  given  to  the  research  on  selection  of  counselor 
candidates  (Carkhuff,  Piaget  and  Pierce,  1968)  and  of 
training  faculty  (Pierce,  Carkhuff  and  Beronson,  1967) 
based  on  this  scale.  Evidence  indicates  that  just  as 
clients  converge  on  the  level  of  erapathic  functioning  of 
their  counselors,  so  do  counselors- in- training  converge 
on  the  level  of  functioning  of  their  trainers.  If  coun- 
seling may  bo  "for  bettor  or  for  worse"  (Lister,  1970), 
counselor  education  may  also  be  "for  better  or  for  worse." 


Subpeofeeaional  Counselcra 

Many  of  the  raters  used  in  this  study  were  Junior 
College  students.  The  fact  that  their  ratings  had  a higher 
correlation  with  the  standard  ratings  than  those  of  the 
other  educational  groups,  suggests  that  the  degree  of 
eiDpathic  awareness  is  independent  of  the  level  of  education. 
Many  of  these  students  will  not  go  on  to  a university  or 
graduate  school.  The  possibility  of  using  such  people  as 
subprofossional  counselors  has  boon  proposed  and  evaluated 
by  a number  of  experts  (Truax  and  Duncan,  1969;  Stoffer, 
1968;  Carkhuff,  1968). 

There  is  a need  for  inherently  helpful  people 
without  special  training  in  interpersonal  relations. 
Jourard's  (1968)  concepts  of  "healers,*'  those  who  habit- 
ually help,  and  "witches"  those  who  habitually  hurt  people 
they  interact  with,  indicate  a need  for  screening  in  em- 
ploying people  as,  for  example,  "house  parents"  In  training 
schools  for  the  physically  handicapped,  or  the  emotionally 
disturbed,  etc.  (Stoffer,  1968).  bach  candidate  could  be 
asked  to  interview  a student.  This  study  Implies  that  a 
selection  board  could  be  easily  trained  in  Che  use  of  the 
scales,  and,  thus,  could  evaluate  such  an  interview. 

If/rpZioatiuna  fop  Feseapoh  in  CounnaZing 

The  fact  that  those  scales  have  been  vised  In  cross 
comparisons  of  separate  studies  indicates  that  many 
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researchers  and  theorists  believe  them  to  be  absolute 
CLister,  1970;  Carkhuff,  Piaget,  Pierce,  1968).  This 
study  implies  that  these  cross  comparisons  may  be  accepted 
with  more  confidence  in  the  validity  of  such  comparisons. 

That  the  training  of  raters  for  research  purposes  1 
is  not  complicated  and  can  be  done  with  a lainimum  of  time 
and  effort  may  be  one  of  the  most  significant  implications  ( 
of  this  study.  The  selection  and  training  of  expert  raters 
could  begin  with  the  type  of  training  used  in  this  study. 

/Rater-candidates  would  be  given  the  brief  explanation  of 
/ the  scale.  Those  whose  ratings  correlate  with  the  standard 
ratings  would  then  be  given  further  training  and.  finally, 
divided  into  groups  as  sets  of  research  raters,  each  group 
having  established  its  interjudgo  reliability  coefficient. 

The  two  major  implications  of  this  study  have  been: 
first,  that  rese.irch  reports  using  the  Accurate  Empathy 
Scale  may  be  compared  with  some  confidence;  and,  second, 
that  the  training  of  raters  is  not  a complicated  task  re- 
quiring many  hours  of  training  and  a highly  skilled  rater- 
trainer,  hut  that  almost  any  group  of  college-age  people 
can  be  trained  in  the  use  of  the  scale  by  someone  who  has 
studied  the  dimensions  and  is  able  to  teach  them, 
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25.  The  businsssnan  and  the  manufacturer  are  much  more 

Important  to  society  than  the  artist  and  the  professor. 

25,  No  sane,  normal,  decent  person  could  ever  think  of 
hurting  a Close  friend  or  relative. 

27,  Familiarity  breeds  contempt. 

25.  Nobody  over  learned  anything  really  important  except 
through  suffering. 
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APPENDIX  C 


TRUAX’S  SCALE  POR  THE  MEASUREMENT  OF  ACCURATE  EMPATHY 


Therapist  seems  completely  unaware  of  oven  the 
most  conspicuous  of  the  client's  feelings;  his  responses 
are  not  appropriate  to  the  mood  and  content  of  the  client's 
statements.  There  is  no  determinable  quality  of  empathy 
and  hence  no  accuracy  whatsoever.  The  therapist  may  be 
bored  and  disinterested  or  actively  offering  advice,  hut 
he  is  not  communicating  an  awareness  of  the  client's 
feelings . 

Therapist  shows  an  almost  negligible  degree  of 
accuracy  in  his  responses  and  that  only  toward  the  client's 
most  obvious  feelings.  Any  emotions  which  are  not  clearly 
defined  he  tends  to  ignore  altogether.  He  may  be  correctly 
sensitive  to  obvious  feelings  and  yet  misunderstand  much 
of  what  the  client  is  really  trying  1;o  say.  By  his  response 
he  nay  block  off  or  may  misdirect  the  patient.  Stage  2 is 
distinguishable  from  Stage  3 in  that  the  therapist  ignores 
feelings  rather  than  displaying  an  inability  to  understand 


Therapist  often  responds  accurately  to  client's 
more  exposed  feelings.  He  also  displays  concern  for  deeper. 


- 89  - 

more  Hidden  feelings,  vhich  he  seems  to  sense  must  be 
present  though  he  dees  not  understand  their  nature  or  sense 
their  meaning  to  the  patient. 

Therapist  usually  responds  accurately  to  the 
client's  mere  obvious  feelings  and  occasionally  recognizes 
some  that  are  less  apparent.  In  the  process  of  this 
tentative  probing  however,  he  may  misinterpret  some  present 
feelings  and  anticipate  some  which  are  not  current.  Sen- 
sitivity and  awareness  do  exist  in  the  therapist,  but  he 
is  not  entirely  "with"  the  patient  in  the  aurrent  situation 
or  experience.  The  desire  and  effort  to  understand  are 
both  present  but  his  accuracy  is  low.  This  stage  is 
distinguishable  from  Stage  S in  that  the  therapist  does 
occasionally  recognize  less  apparent  feelings.  He  also 
may  seem  to  have  a theory  about  the.  patient  and  may  even 
know  how  or  why  the  patient  feels  a particular  way,  but 
ho  is  definitely  not  "with"  the  patient.  In  short,  the 
therapist  may  be  diagnostically  accurate,  but  not  em- 
pathetically  accurate  In  his  sensitivity  to  the  patient's 
current  feelings. 

Therapist  accurately  responds  to  all  of  the  client's 
more  readily  discernible  feelings.  He  also  shows  awareness 


of  many  loss  evident  feelings  and  experiences,  but  be  tends 
to  be  somewhat  Inaccurate  in  his  understanding  of  those. 
However,  when  he  does  not  understand  completely,  this  lack 
of  complete  understanding  is  communicated  without  an  antic- 
ipatory or  jarring  note.  His  misunderstandings  are  not 
disruptive  by  their  tentative  nature.  Sometimes  in  Stage  S 
the  therapist  simply  communicates  his  awareness  of  the 
problem  of  understanding  another  person's  world.  This 
stage  is  the  midpoint  of  the  continuum  of  accurate  empathy. 

Therapist  recognizes  most  of  the  client’s  present 
feelings,  including  those  which  are  not  readily  apparent. 
Although  he  understands  their  content,  he  sometimes  tends 
to  misjudge  the  intensity  of  these  veiled  feelings,  so 
that  his  responses  aro  not  always  accurately  suited  to 
the  exact  mood  of  tlie  client.  The  therapist  does  deal 
directly  with  the  feelings  the  patient  is  currently  ex- 
periencing although  he  may  misjudge  the  Intensity  of  those 
less  apparent.  Although  sensing  the  feelings,  he  often  is 
unable  to  communicate  meaning  to  them.  In  contrast  to 
Stage  7,  the  therapist’s  statements  contain  an  almost 
static  quality  in  the  sense  that  he  handles  those  feelings 
that  the  patient  offers  but  does  not  bring  new  elements 
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to  life.  He  is  ’'with"tho  cliont  but  does  not  encourage 
exploration.  His  manner  of  communicating  his  understanding 
is  such  that  he  makes  of  it  a finished  thing. 

Therapist  responds  accurately  to  most  of  the  client's 
present  feelings  and  shows  awareness  of  the  precise  intensity 
of  most  of  the  underlying  emotions.  However,  his  responses 
move  only  slightly  beyond  the  client's  own  awareness,  so 
that  feelings  may  bo  present  which  neither  the  client  nor 
therapist  recognizes.  The  therapist  initiates  moves  toward 
more  emotionally  laden  material,  and  may  communicate  simply 
that  he  and  the  patient  are  moving  toward  more  emotionally 
significant  material.  Stage  7 is  distinguishable  from 
Stage  6 in  that  often  the  therapist's  response  is  a kind 
of  precise  pointing  of  the  finger  toward  emotionally  sig- 
nificant material. 

Therapist  accurately  interprets  all  the  client's 
present,  acknowledged  feelings.  He  also  uncovers  the  most 
deeply  shrouded  of  the  client's  feelings,  voicing  meanings 
in  the  client's  experience  of  which  the  client  is  scarcely 
aware.  Since  the  therapist  must  ncce.ssarlly  utilize  a 
method  of  trial  and  error  in  the  new  uncharted  areas. 
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are  minor  flaws  in  the  accuracy  of  his  understanding,  but 
those  inaccuracies  are  held  tentatively,  With  sensitivity 
and  accuracy  he  moves  into  feelings  and  experiences  that 
the  client  has  only  hinted  at.  The  therapist  offers  specific 
explanations  or  additions  to  the  patient’s  understanding  so 
that  underlying  emotions  are  both  pointed  out  and  specif- 
ically talked  about.  The  content  that  comes  to  life  is 
now  but  it  is  not  alien. 

Although  the  therapist  in  Stage  8 makes  mistakes, 
these  mistakes  are  not  jarring  because  they  arc  covered 
by  the  tentative  character  of  the  response.  Also,  this 
therapist  is  sensitive  to  his  mistakes  and  quickly  changes 
his  response  in  midstream,  indicating  that  he  has  recognised 
what  is  being  talked  about  and  what  the  patient  is  seeking 
in  his  own  explorations.  The  therapist  reflects  a together- 
ness with  the  patient  in  tentative  trial  and  error  exploration. 
His  voice  tone  reflects  the  seriousness  and  depth  of  his 
cmpathic  grasp. 

The  therapist  in  this  stage  unerringly  responds  to 
the  clioni’s  full  range  of  feelings  in  their  exact  intensity. 
Without  hesitation  he  recognizes  each  emotional  nuance  and 
communicates  an  understanding  of  every  deepest  feeling.  He 


is  complotely  attuned  to  the  client's  shifting  emotional 
content;  ho  senses  each  of  the  client’s  feelings  and 
reflects  them  in  his  words  and  voice.  With  sensitive 
accuracy,  he  expands  the  client’s  hints  into  a full- 
scale  (though  tentative)  elaboration  of  feeling  or  expe- 
rience. He  shows  precision  both  in  understanding  and  in 
communication  of  this  understanding,  and  expresses  and 
experiences  them  without  hesitancy. 
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Client:  We  never  gut 


Yeah,  that's  it.  It's  like  that  all  the  timel 
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Um  hum. 

i f.;.1 ■“ 

Well,  um  huml 


But  I think  it  is  worth  the  effort.  And  no  use 
Uh  huh;  uh  huh. 


Iildjli  i I 


That's  right.  Well,  that's  just  it,  I when  I think 
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RELIABILITIES  OF  R.ATING  SCALES  FOR  ACCURATE 
EMPATHY,  NONPOSSESSIVE  WARMTH,  AND 
GENUINENESS  FROM  SPECIFIC  STUDIES 
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FINDINGS  ON  THE  THERAPEUTIC  EFFECTIVENESS 
OF  ACCURATE  EMPATHY 
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